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Abstract 

Given two sets of data which lead to a similar statistical conclusion, 
the Simpson Paradox [10] describes the tactic of combining these two 
sets and achieving the opposite conclusion. Depending upon the given 
data, this may or may not succeed. Inverse Simpson is a method 
of decomposing a given set of comparison data into two disjoint sets 
and achieving the opposite conclusion for each one. This is always 
possible; however, the statistical significance of the conclusions does 
depend upon the details of the given data. 

1 Introduction 

Anyone contemplating a statistical analysis is warned, at an early stage 
of the game, "but don't combine the statistics of monkey wrenches and 
watermelons", or the equivalent. Failure to heed this instruction - at a more 
sophisticated level, to be sure - gives rise frequently to Simpson's Paradox 
(here, in its 2-trial sequence version): if choice A is "statistically better" 
than choice B in each of two sets of trials under differing circumstances, 
then it may happen that merging the two sets of data produces the opposite 
conclusion. Consider the following specially constructed example for the 
sake of illustration: 

In Fig. 1, we pictorially represent trial sequence #1 by a solid line, trial 
sequence #2 by a dashed line; trial #1 tests drug A, Ni times, drug B, 
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Table 1: Simpson Paradox Prototype 







Trial #1 


Trial #2 


Total 


Sa = 


A successes 


60 


60 


120 


Fa^ 


A failures 


20 


140 


160 


Sb = 


B successes 


140 


20 


160 


Fb = 


B failures 


60 


60 


120 



60/80>140/200 and 60/200>20/80 
but 120/280<160/280 



N2 times, while trial ^2 reverses the number of tests. The successes S, 
and failures F arc shown for each drug in each trial sequence. If a < 6, 
so 1 — a > 1 — 5, then clearly the S/N ratio of drug A is larger than that 
of B in both trial sequences, so drug A certainly seems better. But in 
the combined trials Sa/Na = ((1 - a)Ni + hN2)/{Ni + N2) is lower than 
Sb/Nb = {{l-b)N2 + aNi)/{Ni + N2) if {l-a)Ni + bN2 < {l-b)N2 + aNi, 
or 

N.<^N,, (1.1, 
a quite feasible circumstance, so that drug A has now become inferior to B\ 



DrngA 



— '1 . — gW, 




Figure 1: Simpson Paradox Prototype 



This phenomenon is well-known and well-documented [5] [6] [7] [8] [9] [10] 
- but hope springs eternal. Only recently [1], a drug manufacturer, whose 
current potential blockbuster drug (Xinlay) failed to better a placebo in two 
clinical trials with uncorrelated protocols, proposed to a regulatory agency 
to pool the two sequences. If accepted, their drug would then outperform 
the placebo, allowing them to move forward. The regulatory agency panel 
was not unaware of the forced paradox, and denied the reinterpretation of 
the data. 
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2 Inverse Simpson 



The Simpson Paradox is data-driven. As in (1.1), it may, or may not, hold 
in a given situation. However, what we may term inverse Simpson paradox 
is a different story: can we take a long pair of data streams - say successes 
and failures with drug A, and similarly with drug B - and decompose them 
into two pairs of subsequences, each of which reverses the conclusion of the 
original pair? This can be carried out in different ways and for different 
purposes, 

a) Most directly and legitimately, it may be realized that data from two 

sources were combined for simplicity, and so there is a unique decom- 
position called for, which may indeed reverse the conclusion. This 
appears to be the case in the oft-quoted Berkeley sex discrimination 
controversy [5]. 

b) Least directly and least legitimately - but perhaps an effective strategy 
in litigation - one can ask for that decomposition that maximally 
reverses the conclusion, and then use ingenuity to characterize the 
subsets thus obtained. 

c) Putting a different spin on b), one can ask for that decomposition 
that maximally comes jointly to either conclusion, and use this as an 
investigative tool to recognize a hidden characterization of significant 
subsets of related entities. 

At first blush, inverse Simpson, in contexts b) and c), is trivially accom- 
plished. Fig. 2 illustrates the principle. 



The dotted lines refer to the assertedly pooled data, clearly indicating 
that A loses to B. The hypothetical trial 1 data is represented by solid lines, 
and since A has only successes, it is surely superior. And the dashed lines 
refer to trial 2, in which B has only failures, and so surely loses. 




Drug A 



DrugB 



Figure 2: Inverse Simpson Prototype 
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But Fig. 2 is a suspiciously extreme version of a strategy that can be 
made to look more reasonable. To put it in context, let us consider the 
well-known Berkeley sex discrimination case [5], which wc will paraphrase 
for numerical simplicity. The original data is that in one division, Sa = 41 
out of Na = 100 male applicants were admitted, a success rate of Pa = -41. 
On the other hand, Sb = 29 of Nb = 100 female applicants were admitted, 
a success rate of only Pb = .29. Clearly, it would seem that the admission 
process discriminated against females. This was not the case. In fact, 



Table 2: Simplified Berkeley Admission Data 

Dept. 1 Dept. 2 

Male Applicants 30 70 

Males Admitted 6 35 

Female Applicants 70 30 
Females Admitted 14 15 

Total Male Admissions/Apphcants 41/100=.41 
Total Female Admissions/ Applicants 29/100=. 29 



Table 2, Simplified Berkeley Admission Data, was arrived at by combining 

that of two departments, say 1 and 2. Referring to Table 2, we see that the 
success rates of males in the two departments were Pai = -2, Pa2 = -5, with 
the corresponding female success rates of Pbi = -2, Pb2 = -5. There was no 
demonstrable discrimination in either department, but "mixing watermelons 
and monkey wrenches" created very much of a statistical artifact. 

Let us proceed to a general situation. We are given Na and Pa = 
Sa/Na, Nb, and Pb = Sb/Nb for which, without loss of generality. Pa > 
Pb- We then imagine compartmentalizing the ^-pool as Na\ = oiNa, Na2 = 
(1 — a)NA, and the B-pool as A^'^i = PNb, Nb2 = (1 — P)Nb] the success 
rates are to be given via Sai = PaiNai, Sa2 = Pa2Na2, Sbi = PbiNbi, 
Sb2 = Pb2Nb2- The question then is whether a and (3 can be chosen so 
that 

P.^A^ft. ^^^^ 

Pa2 = ^ = Pb2, 

indicating no advantage to A ox B in cither case. This is trivial. Since 

Sa\ = aXNA, Sa2 = (1 - Oi)nNA, Sbi = PXNb, Sb2 = (1 - P)iJ'Nb, we 
must have 

PA = a\ + {l- a)ii 
Pb = /3A+(1-/3V 
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Figure 3: Placement of Averaging Parameters A and n 



Thus, Pa and Pb are both averages of A and /i, which therefore must he 
outside the interval (Pb, Pa) as in Fig. 3. Exphcitly, of course, we have 

a = 13= 

- A /X - A . 

Pa-X , . Pb-X ^^-^^ 
l-a= 1-/3= - 

— A fi — A 

In situations not as clear cut as the Berkeley case, we would want to invent 
a hypothetical decomposition in which e.g. A is roughly in the middle of 
the (0, Pb) interval, fi roughly in the middle of (P^,!), in order to allay 
suspicion. In the Berkeley case, we see that A = .2, = .5 do satisfy this 
criterion. 

With (2.3), we find that a suitable decomposition removes the apparent 
bias against females: no assertion can then be made. But Fig. 2 illustrates 

a proactive strategy, in which a suitable decomposition reverses the original 
assertion and appears to establish the superiority of A. What is wrong 
with the construction of Fig. 2, aside from its suspicious extreme nature? 
Nothing, but the conclusion is questionable because we have not attended to 
the statistical significance of the new assertions, a point that was emphasized 
by the FDA panel cited above. Doing so forms the substance of our ensuing 
discussion. 



3 Statistical Significance 

A prototypical situation calling for statistical assessment is this. A sequence 
of N independent Bernoulli trials - successes or failures - is carried out on 
the same object, resulting in S successes. Given e, with what probability, or 
confidence , can we claim that p, the intrinsic success probability parameter, 
satisfies 

\p - S/N\ < (3.1) 
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The standard approach is to start with the elementary result that, regarding 
5" as a random variable and defining q = 1 — p, 

Pr{\S-Np\ <N^/^e\p) 

where [ ] denotes integer part. The device then is to identify (3.2), which is 
a probability on S'-space, with a probability on p-space: 

Pri\p - S/N\ < e/ArV2|5) = p^^^s - Np\ < N^/^e\p) (3.3) 

signifying our confidence that (3.1) holds. 

The sort of information that will interest us will, however, in the con- 
text of this prototype, be more like: with what confidence, based upon the 
observed value of S, can we claim that 

p > 1/2? (3.4) 

Now, the above recipe is not readily applicable, since we are no longer ques- 
tioning a relationship between p and S that makes possible the sub rosa 
journey from S-space to p-space. But this is indeed the province of the 
Baycs approach [4] which - ignoring the controversy that continues to swirl 
around it - is what we will use. First of all, let up recall what (3.1) would 
become in a Bayesian context: we imagine joint {p, S')-space and quote the 
obvious 

Pr{p = p'\S = S') = Pr{S = S'\p = p) f{p)/Z 

I" (3.5) 
where Z= / Pr{S = S'\p = p") f{p")dp" , 



f here referring to probability density. If f{p') is the prior density on p- 
space, then 



Pr{\p - S/N\ < e/N'^/^\S = S') 

f{p')p"'r-''dp'/z (3_g) 



S'/N-e/N^/^ 



f f{p')p"'r-''dp'. 

Jo 
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But suppose we choose a uniform prior, f{p) = 1; then (3.6) becomes 

Pr{\p - S/N\ < e/iV^/^) 

/•minis' /N+e/N^/'^,N) 
Jma.x{0,S'/N-e/m/^) ^"^''^ 

Z= f p'^q'^'-^dp' = {{N + 1) {^))-\ 
Jo 

Eqs. (3.2, 3.3) and (3.7) are certainly not identical, but if we go to the large 
sample regime, i.e. the normal approximation to the binomial, then (3.2, 
3.3) aver that 



Pr(|p-5/Ar| < e/ArV2) = / e'^' ds'/V2^, (3.8) 



which, it is easy to show is identical with the large N , fixed S/N , steepest 
descent expansion [3] of (3.7) around p' = S/N. 

On the basis of the above equivalence, we now go immediately to the 
question indicated by (3.4). Using Bayes with a uniform prior, precisely as 
in (3.7), we have 



1 rl 



= 1 - Si/2(S + 1, + 1 - S)/B{S + 1, iV + 1 - 5), 

where B is the Beta function, 5^/2 the corresponding incomplete Beta func- 
tion [2]. Eq. (3.9) can also be written in the neat form 

N-S 



(1 \ ~ 



3=0 
3=0 



(3.10) 



The important point however is that this construction leads quite directly 
to evaluation of quantities such as Pr{pA > Ps), that are appropriate to the 
Simpson paradox. 
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4 Level of Significance of the Inverse Paradox 



The effect we are studying is not very subtle, and so it is sufficient to take 
a large sample limit, which strategy we adopt. However, there are several 
sample parameters, leading to the meaningful use of additional limiting op- 
erations. Consider first the prototype, Eq. (3.10); here. 



N-S 



M^) = E(T)/2^^^ (4-1) 



j=Q 

expresses the level of significance of the assertion that p > ^, and it is not 
until such an assessment is made that one can declare meaningful compar- 
isons. Let us evaluate (4.1) in the large sample limit in a familiar fashion 
that extends at once to the question of Pr{pA >Pb) relevant to the Simpson 
paradox. 

Although (4.1) is finite and explicit, its implementation for large N and 
S - while trivial numerically - is a bit complex. For this purpose, the 
expression (3.9) is more useful; it says that 



aN{S) = C'\^{l-pf-^dp/ C p^{l-pf-^dp. (4.2) 
Jq Jo 



"1/2 

'0 ' Jo 
By the large sample limit, we will mean that in which 



s = f^g _ (4 3) 

is fixed (to within N"^/^) as N —>■ oo, and we then ask for 

a{s) = lim aN{S). (4.4) 

AT— »oo 

This is obtained quite directly by a steepest descent evaluation [3] of (4.2). 
The relevant integrand is now 

/(p)=p«(i-p)"-» 

(| + ivV^.)l„p+(|_jV'/2.),„(i_rtl 



= exp 
with a maximum at 



PO 2 



l+N-'Z-'s, (4.6) 
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and a corresponding expansion starting as 



I(jp) = I{po) exp - 



{p-PoY/ 1- 



1 



N 



Hence 



a{s) = lim 



N—i-oo 



1/2 







/-2s 
e~ 
-Ari/2_2s 

-2s 



dx/ 



2N{p-pof 

-2s 
Arl/2_2s 



Afl/2_2s 



dp 



^"l^dx 



/ — J,S fOO 
e-^ l'^dx/ / e-- l''dx, 
-oo J — oo 

immediately recognizable in a normal distribution context. 
We can then proceed to the desired evaluation of 

Pr{pA > Pb\Sa, Sb, Na, Nb) = 

[f{PA,PB) 



II 



1>PA>PB>0 

Pr{SA, Sb, \pa,Pb, Na NB)]dpA dpB/ 



J j [fiPA,PB) 



1>PA>0 

l>Ps>0 



(4.7) 



(4.8) 



(4.9) 



Pr{SA, Sb\pa,Pb,Na, NB)]dpA dpB- 

This is carried out in Appendix A, where we choose Bayes with uniform 
prior on PA,PB space and process (4.9) as we did (4.2). The result is that 
for large Na,Nb, 



Pr{pA > Pb) 



Sa_Sb_ /( Sa{Na-Sa) 
Na Nb / \ N^s 



1 /"^ _1 2 

where ^(x) = / e dy 

v27r 7-0O 



Unsurprisingly, wc can obtain (4.10) as well by a version of the proba- 
bility space equivalence assertion employed in (3.3). It is only necessary to 
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consider the random variable 

^ Na Nb ^ ^ 

where Sa and Sb are binomially distributed with success probabihties pA 
and pb- Since we find at once that 



Pr{pA-pB<^-^-A\SA,SB 



(4.12) 



= ipAe^/''^ + qAf^iPBe-'"''' + qef'', 
it follows directly that 

E{^\pa,Pb) =PA-PB 

it\ \ PA ^ PB (4-13) 

Ysx{i\pA.PB) = j^^qA+j;^qB 

and then from the central limit theorem that in the limit A^^, A''^ oo, 

''K^-^-^^-^^+^l^^'^") (4.14) 

= cPi-A/ipAqA/NA+PBqB/NB)'^^) 



The same sleight of hand as in (3.3) then converts this to 

i, 

i/2\ (4.15) 



,/ fSA{NA-SA) ^SBiNB-Ss) 

and so, setting A = ^ — to (4.10), as was to be shown. 

5 Realizations of the Inverse Paradox 

Now let us make use of the result (4.10). If our initial data is characterized 
by Sa, Sb, Na + Ng = N, and Pa = Sa/Na, Pb = Sb/Nb, then the 
confidence level with which we can assert that Pa ^ Pb given by 

HN^/^Cab) 

Cab = {Pa - Pb)/(tab > ^^^^ 
2 _ Pa{1 - Pa) Pb{1 - Pb) 
Na/N Nb/N ■ 
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Our objective is to supply a decomposition into two hypothetical trials 

{Sai, Nai, 

Sbi,Nbi) and {Sa2,Na2,Sb2,Nb2) such that 

if q = (Pb, - PaOM, 1 = 1,2 
where Pai = Sai/Nai, Pbi = Sbi/Nbi 

2 ^ Pm{1-Pm) PB^{l~PBi) (5.2) 

Nm/N + NBi/N ' 
thenC^ > Ofori = 1,2. 

In fact, to be definite, we suppose that the two pairs of trials reverse the 
initial assertion at a common level of confidence 

{Pbi - Pai)/<ti = C = {Pb2 - Pa2)/o2 (5.3) 

with C > 0. To start, we need to find the restrictions on C' under which 
the required {Pai, Pa2, Pbi, Pb2) satisfying (5.2) can be found. 

The solution is direct but algebraically cumbersome, and is presented in 
detail in Appendices B and C. The conclusion of the former is that if a > 
then 

'PPa-ocPb PPA-aPs 

(5.4) 



C'<min. 

aas pcFA 
aPs - PPa aPB-/3PA \ 
acTB ' fioA J 

Since we require C > 0, this implies that 

a/ 13 > Pa/Pb > 1, P/a > Pb/Pa > 1- (5.5) 

In (5.4) and (5.5), we uniformly adopt the notation: 

if < X < 1, then x = 1 — x. (5-6) 

Eq. (5.4) is a bit involved and, even worse, contains the unknown pa- 
rameters pAi, PBi implicitly. But it can be simplified by reducing its right 
hand side and thereby strengthening the requirement on C a bit. This is 
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carried out in Appendix C, with the conclusion that, if a > /3, then 

Pa + Pb>1:C'< 2(77)1/2 ((p^/P4)2 _ i) 

PaPb 
PbPa. 

Pa + Pb <l:C' < 2(77)i/2((p^/p^)2 _ i) 



{Pa-Pb){Pb/Pa)/ 



1 



(5.7) 



{Pa-Pb){Pa/Pb)/ 



(PaPbX 
\PbPa) 



1 



where 7 = Na/N 

are sufficient to carry out the apparent reversal of ranking of A and B. 

Let us take a simple example that has been previously quoted [4] [8]. We 
will paraphrase it and use rounded off data. Hospitals A and B specialize 
in treating a certain deadly disease. Na = 1000 patients are treated at A 
and Nb = 1000 at B. Of these, Sa = 900 recover, while Sb = 800 recover, 
so that Pa = .9, Pb = .8 and Hospital A is apparently the place to go. 
In fact, one computes Cab = -05, so that this conclusion is supported at 
the .05 X (2000)1/2 = 2.24 standard deviation level. Detailed investigation 
shows that matters are not so simple. Some patients enter in otherwise good 
shape, others in poor shape. Of the former, A^^i = 900 enter hospital A, and 
870 recover; of the latter, Na2 = 100 enter and 30 recover, so Pai = .967, 
Pa2 = .3. 



Table 3: Simplified Hospital Recovery Data 



Admissions to Hospital A 
Recovered in Hospital A 
Admissions to Hospital B 
Recovered in Hospital B 



Good Shape 
900 
870 
600 
590 



Poor Shape 
100 
30 
400 
210 



Total Recovered/ Admissions in A: 900/1000=. 9 
Total Recovered/ Admissions in B: 800/1000=.8 



On the other hand, Nbi = 600 enter Hospital B in good shape and 
Sbi = 590 recover, whereas Nb2 = 400, Sb2 = 210. Thus, Pbi = .983, 
Pb2 = -55. We see that by not mixing the two classes of patients. Hospital 
B is superior for each class - at levels C[ = .038 (1.7 standard deviations) 
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and C2 = .176 (7.9 standard deviations). Simpson, or inverse Simpson, 
depending upon one's point of view, is certainly exemplified. 

Of course, the criteria as to which patients entered in good shape, which 
in poor shape, are a bit fuzzy. Given the aggregate data, the decomposition 
into the two classes could, as we have seen, been planned with the intention 
of most convincingly asserting the opposite of the conclusion from the ag- 
gregate data. If this had been done according to the prescription of (5.7), 
then with the same input data, we would have found a = .935, /3 = .738 
(not far from the a = .9, (3 = .6 corresponding to the additional data pre- 
sented) and concluded with the superiority of Hospital B at a confidence 
level corresponding to C < .107 or 4.79 standard deviations for each class 
of patients. 

6 Concluding Remarks 

The Simpson paradox, one of the simplest examples of the common misuse 
of statistics (think meta-analysis?) has received increasing attention, since 
the consequences of its use - or misuse - can be quite severe (as well as 
profitable). In the classical Simpson Paradox, the only question is whether 
or not to combine data from different sources (and trying to justify the de- 
cision to combine). What we have seen here is that the inverse Simpson 
paradox, even in its most "sophisticated" version in which mean differences 
are weighted by appropriate standard deviations, is nearly universally ap- 
plicable. This can be an effective analytical tool, but can equally well be an 
effective technique for distorting statistical data. 
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A Evaluation of (4.9) 

Choosing Bayes with a uniform prior on pa,Pb space, (4.9) becomes 
J J Pr{SA,SB\pA,PB,NA,NB)dpAdpB/ 

1>PA>PS>0 

J J Pr {SA,SB\pA,PB,NA,NB)dpAdpB 
^>Pa,Pb>0 

= J J Pb"" Qa" Qb"" dp a dpB I 

// v'jtvl^^l'cil-dpAdpB (A.l) 

= {J\%^qB''dp^ P^jt^f/dpAl 
J^'J plU'y'PAUA'dpBdpA 

= C Bp^ (Sa + 1,Fb + 1) pf q^^ dp a/ 
Jo 

B (Sb + 1,Fb + 1)B{Sa + 1,Fa + 1). 

Applying the known expansion of the incomplete Beta function [2], this 
reduces after a little algebra to 

Pr{pA>PB\SA,SB,NA,NB) 

Fa+Fb 

I ( Na+Nb+2\ 
I \ Na+1 ) ' 

or introducing S = Sa + Sb, N = Na + Nb for notational convenience, 
Pr{pA>PB\SA,SB,NA,NB) 

j=0 
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But we will go to the large sample limit defined by fixed 



s' = Nj,^/^ [Sb-^Nb] 1-j = Nb/N 



(A.4) 



as A/" — oo. We could proceed precisely as in (4.5 - 4.8), but if we imagine 

a large sample limit from the outset, the derivation is brief and standard. 
Consider drug A. A uniform prior for pA is given by the beta distribution 

/(pa) = 6(1,1; pa) 
where 6(m,n; pa) = p'a^^q'a^ / B{m,n) (A.5) 
B{rnin) = m — l!n — l!/m + n — 1! 

which, afterS'A successes in Na trials creates the posterior distribution 

h{l + SaA + Na-Sa-^Pa). (A.6) 

Drug B works the same way. It follows that 

Sa + 1 Sb + 1 



E{PA - Pb) = 



Na + 1 Nb + 1 



w t ^ {Sa + 1){Na + 1-Sa) 

^-^^^ - = [NA + mNA+2) (^■^) 

{Sb + 1){Nb + 1-Sb) 
{NB + inNB + 2) ' 

and so by the central limit theorem for large Na-, Nb, 

p. . . ,fSA Sb ,f SA{NA-SA) 

(A.8) 

where (/)(x) = f e~^^ dy. 

v27r 7-0O 

B Restrictions on C 

Eq. (5.3) itself imposes two conditions. Aside from the crucial < Pai, Pa2, 
PbiiPb2 < 1) there are just two more due to the composition conditions 
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that Sai + Sa2 = Sa, Nai + Na2 = Na, Sbi + Sb2 = Sb, Nbi + Nb2 = Nb- 
We reintroduce the notation of Section 2: 

Nai = aNA, Nbi = I3Nb (B.l) 

and hereafter uniformly adopt the notation that 

if < a; < 1, then x = l-x. (B.2) 

Thus Sai + Sa2 = Sa imphes PaiNai + Pa2Na2 = PaNa, or 

aPAi + aPA2 = Pa (B.3) 

and similarly 

PPbi + PPb2 = Pb- (B.4) 
We also append (5.3) in the form 

Pbi - Pai = C'ai 
Pb2 ~ Pa2 = C'a2, 
and solve (B.3), (B.4), (B.5) to yield 



(B.5) 



ry ry 

Pai = Ki + CVb, Pa2 = K2 C'aB, 

a — p a — p 

Pbi = Ki + CV«, Pb2 = K2 - C'aa 

a — p a — p 



(B.6) 



(B.7) 



where 

Ki = {/3Pa - aPB)/{a - (3), cJq, = aai + q:(T2, 

K2 = iaPB-pPA)/{a-p), ap = pai+Pa2. 

Eqs. (B.6), (B.7) are realizable if the requirements < Pai, Pa2, Pbi, Pb2 < 
1 are satisfied. Since we are asserting, without loss of generality, that 
Pa > PB) we of course have the condition 

Pa>Pb,Pb>Pa. (B.8) 

There arc then two cases to consider. If a > /?, it is easily seen that Ki > 0, 
K2 < 1, so that Pai, Pbi ^ 0, Pa2,Pb2 ^ 1 Sire already satisfied. The 
remaining four conditions Pai, Pbi < 1, Pa2, Pb2 > can then be gathered 
together as 

ifa > /3then 

O \ Illlll I , = , 

- \ aap ' Paa (B.9) 

(g - P)K2 {a-P)K 2 
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or, inserting (B.7), 

. fppA-apB ppA-apB 
C = mm z , = J 

aPB - PPa aPB-pPA \ 
aap ' J ' 

Similarly, 

if a < /3 then 

. faPB-PPA aPB-pPA 
C < mm , ^ , 

PPA - apB /3pA-apB \ 
Since we require C > 0, immediate consequences are that 

if a > p,then — > — > 1, — > > 
p- Pb- a - Pa- 

P Pb a Pa 

must hold. 



C Simplification of (5.4) 

The major step is the observation, from (5.2) that 



so that 



Hence, 



N / 1 1 



4 \NAi NBi 



2 N / 1 1 



uNa PNb 
2 N / 1 1 



4 \aNA PNb 
. 1 / AT N 

' - 4a \Na Nb) 
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yielding 

^^}<max(a„a2)<-(^— + — J 



max 



(C.4) 



Setting Nj\/N = 7, condition (5.4) can therefore be strengthened to 

a>l3:C'< 2(77)^/2 min(/3, a) ^2 



min 



\{PPa) -aPn), -{aPB - PPa) 
.P ^ 



(C.5) 



And in the same way, we obtain the strengthened 

a<B:C'< 2(77)^/2 niin(a,/3)^/2 



mm 



^{aPB - PPa), liPPA - aPe) 
a p 



(C.6) 



Eqs. (C.5) and (C.6) are vahd for all a,/3, and we may indeed find the 
largest feasible range for C' by maximizing their right hand sides over a and 
p. Again, to reduce complexity, let us take the special case in which: 

a>P:a/p = {Pa/Pb)\ P/a = {Pb/Pa? (C.7) 

so that 

« = [{PA/PB)\PB/PAf - [Pa/Pb?]/ 

[{PA/PB)\PB/PAf-l] (C.8) 
P = [{Pb/Pa? - m{PA/PB)\PB/PA? - 1] 
converting (C.5) and (C.6) to 

a>P:C'< 2{^-ff'^/[{PA/PB)\PB/PAf - 1] 

min[(PB/PA)' - 1, {Pa/Pb? - 1] (C.9) 
• miii{PA - PaIPb^Pb - pI/Pa). 

But {Pa - PaIPb) - (Pb - PI/Pa) = (1 - Pa - Pb){Pa + PbT/PaPb 
and {{Pb/Pa? - 1) - {{Pa/Pb? - 1) = {Pa + Pb - 1)^ (ft + ft)- 
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so it follows that in the a> (3 case, 




1 



1 



1 



1 



) 



) 



(C.IO) 



are sufficient to carry out the apparent reversal of ranking of A and B. The 
decomposition corresponding to the choice a < (3 can of course be similarly 
specialized. 
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